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Description 

FIELD OF THE INVENTION 

5 [0001] The present invention relate In general to the processing of digitized pictures and in particular to picture 
coders, wherein output data streams must have a constant bit-rate (CRB) to satisfy the requisites of the transmission 
systems and/or of recording systems The invention is particularly useful in video coders based on the MPEG2 protocol 
that generates a variable bit-rate (VBR) output stream and that require an output buffer in which restoring the required 
constant bit-rate by appropriate regulation, but also in coders based on different protocols such as MPEG2, to MPEG1 . 

10 H263andH261. 

BACKGROUND OF THE iNVENTION 

[0002] A precise control of the number of bits per Group-Of-Prctures (GOP) is very important when recording the 
75 coded video sequence on an appropriate support (for example a CD-ROM) according to the MPEG2 standard. Indeed, 
it permits simple play back operations such as fast-fbnvard and fast-backward tiianks to the fact that the GOP are reg- 
ularly ^3aced. 

[0003] Only few pictures of the sequence are visualized during these reproducing operations. In particular, only the 
intra picture (I) are decoded, tiie only ones allowing a random access because they are independentiy coded from the 
20 rest of the sequence. Therefore, tiie reproduction consists in searching witiiin the bit-stream and decocfing of tiie first 
frame (I) of each GOP. Such a s^rch may take place by a jump approximately dose to the start of tiie GOP and tiien 
by reading the bit-sta-eam looking for the start code of tiie picture. Alternatively, tiiere is tiie possibility to insert a limited 
nuni)er of stuffing bits in order to make exactly constant tiie number of bits per GOP and thus eliminating the searching 
in the bit-stream. 

S5 [0004] Anotiier advantage is tiie simplification of pichjre editing. Indeed, it is possible to take a snnall portion of film, 
rnodify it record it and insert it witti precision tiie same bit-sti'eam interval. Witiiout such a possibility it would be neces- 
sary to read and save the entire bit-stream in order to assign tiie necessary space to tiie modified sequence. 
[0005] The objectives of any CBR system for oontiiolling tiie brt-rate are: 

30 1) having preliminarly fixed the desired (target) bit-rate and taking into consideration pictures of a certain com- 
pletely (detail), determining the coding quality and ttierefore the appropriate parameters for regulating tiie coding 
itself in such a way to obtain tiie predefined t>it-rate; 

2) maintaining through a contirol process, a local quality as uniform as possble witiiin the sequence. 

35 [0006] Of course, the ability of accurately control tfie number of bits produced by each frame has repercussions 
also upon the quality of the coded sequence. Indeed, when the control system assigns the target bit-<ate for the frame 
to be coded, it also distributes the remaining bits in the GOP among the frames tiiat are still to be coded. If tiie bits pro- 
duced by tiie frame coding are greater in number ttian tiie predicted ones, in tiie GOP will remain less bits than previ- 
ously envisaged. Thereby, control system will have to diminish the bits at disposal of tiie successive frames somehow 

40 penalizing ttieir quality. These problems occur mainly when there are substantial changes of the picture statistics (and 
tiierefore of tiieir conplexity). This typically happens upon changes of scene. The two objectives mentioned above are 
then conflfoting. 

[0007] In the ensuing descnpti'on, reference will be made to tiie following publications on these topics and their per- 
tinent contents recalled as needed: / 

45 

♦ [1] International Standard ISO^IEC 13818 (MPEG2), "bifbrmation Technology - Generic coding of moving pictures 
and associated audio". Mar. 1994. 

♦ [2] B. G. Haskell, A. Purl, A. N. Netravali. "Digital video: an introduction to MPEG2", Chapman & Hall. ISBN 0-412- 
08411-2. 

50 4 [3] Test Model Editing Committee. '*MPEG2 Video Test Model 5". ISO/lEC JTC1/SC29AA^G1 1 (April 1 993). 

♦ [4] G. Keesman, I.. Shah, R. Klein-Gunnewiek, ^'Bit-rate comrol for MPEG encoders'. Signal Processing: Image 
Communication 6 (1995) pp. 545-560. 

♦ [51 W. Ding, B. Liu, "Rate Control of MPEG video coding and recording by Rate-Quantization modeling", IEEE 
Transactions on Circuits and Systems for Video Technology, vol. 6. no. 1. Feb. 1996. 

ss 4 [6] S. Bilato, G. Calvagno. G.A. Mian. R. Rinaldo. "Accurate Bit-Rate and quality Control for the MPEQ2 video 
coder", proceedings of ICIP-97 (international Conference on Image Processing). Santa Barbara 1997. 

[0008] The artide by Keesman [4] describes the finding of a conpromise between tiie two contrasting objectives by 
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introducing a technique based on defining a so-called Bit Usage Profile which allows to predict how the bits should be 
distributed within a picture. Such a technk|ue is also described by Mian [6]. ; 

[0009] In the majority of video coders there is a control signal which directly influences the regulation of the local 
quality of a picture. Such signal will be symbolized by Qj (where Q stands for quantizer). A high value of Q/.con-esponds 
5 to a low quality of the image (coarse quantization) while a moderate value corresponds to a higher viewing quality (fine 
quantization). 

[001 0] The objective of keeping a uniform quafity is then equivalent to having a substantially constant control signal 
Qj, within the picture and possibly throughout the entire sequence of pictures. 

[0011] According to the system or model desaibed in the article "MPEG2 Video Test Model 5" [3], briefly refen-ed 
10 to as TTVI5. and also according to other CBR models cited in the bibliography, as for example in [5|, the control of the bit- 
rate is typically realized in three phases: 

1) estimation of the global target bit-rate (Global Target bit allocation), wherein a prediction of the number of bits 
required for each picture of the sequence is generated. Of course the complexity of the picture and the state of fill- 

15 ing of the output data buffer must be considered; 

2) determining the reference value Qj[n] of the local quantization parameter (Local Control) for the n-th macrob- 
lock (MBn) that satisfies previously estimated target bit-rate; 

3) calculating the "effective quantization" parameter mquant[n] for each macroblock [1], generating it in function of 
the filling state of the output buffer and of the activity of the macroblock itself (Adaptive Quantization). 

20 

[001 2] While according to the reference model TM5 [3] the latter function is considered a substantial part of the bit- 
rate control, according to other known systems [4], [5] and [6], as well as in the context of the system of the present 
invention, is considered a separate problem concerning the conversion of the control signal Qi[n] m the regulating 
parameters of the mquant[n] encoder, acoording to the formula: 

25 

30 where act[n] is a measure of the detail of the picture based on the variance of the n-th macroblocK while avg^act is the 
average value of act[n] on the whole picture. The expression within the brackets is often referred to as the normalized 
activity Nact[f], 

[001 3] Therefore, in the ensuing description, the Adaptive Quantization will not be taken into consideration because 
it is a separate function, and it is not directly involved in the algorithm of the present invention. 
35 [0014] Both the global target estimation and the local control use Information on how to distrtoute the bits among 
the different pictures of tiie sequence and within the single pictures. These information may be derived from the knowl- 
edge of tiie previously coded pictures (according to the TM5 [3]). or by means of a modeling of tine Rate-Quantization 
cunre (according to the system described in [5]), or derive from a pre-analysis carried out on each picture (as in [4] and 
[6]). 

40 [001 5] In tiie system described by Mian [6], a pre-analysis is carried out to calculate the entropy of tiie DCT coeffi- 
cients, after having approximated tiie probability density function by way of histograms. By establishing a certain quan- 
tizer O. tiie entropy Hn of tiie n-tii quantized DCT coefficient, is calculated as: 



45 ^ 

/fti = - 5] • (log2 

where Pt^ ^ indicates the probability to c^tain the K-th symbol at tiie output of the quantization. Therefore tiie average 
50 entropy represents tiie actual bit-rate R: 



[0016] The estimates otitained wHh the statistical model, when the quantizer O* for each macroblock is fixed, are 
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in accordance with the performances of an MPEG2 encoder, wherein Q* represents the average value of the quantiz- 
ers effectively used [6]. 

[0017] It should be noted that with this method, only global estimate (that is on a whole picture) of the actual bit- 
rate R may be obtained, becai^e the histograms require a large number of samples to be statistically valid. Hence, a 
5 possible local level estimate of the single n-th macroblpck is lost. 

[0018] The pre-analysis with pre-codng, according to the method described by Keesman [4], constitutes a first 
example of reference and comparison and is hereinbeiow reported as a basis for subsequently describing the method 
of tiie invention and point out the differencea 

10 CONTROL OF THE BIT-RATE WITH PICTURE PRE-ANALYSIS 

[0019] The scheme of Fig. 1 highlights the following elements of the functional coding blocks: 

a) an MPEG2 encoder implementing a hybrid scheme of DOT interframe coding, according to what described in 

75 [II and [2]; 

b) an adaptive quantization block input with the control signal Qj[nJ and which outputs the effective quantizer 
parameter mquant[n]; 

c) a block computing tiie estimated global number of bits (Global Taiiget bit allocation), using information derived 
from previously coded pictures and using Information derived from a pre-analysfs of tiie cun'ent picture; . 

20 d) a block computing tiie Bit Usage Prof De (BUP), using information derived from the above-mentioned pre-anal- 
ysis: 

e) a proportional-integrative local controller (briefly P-l which obtains the right Qj signal that will produce tiie 
required bit-rate. 

2S [0020] This bit-rate control scheme presents some important differences compared to the TM5 model: 

1) in the computation of tiie global target a difference is made between overhead data and tiie data relative to tiie 
coding of tiie DCT coeffidents; (by the expression overhead bits we refer to all tiie txts not used for coding the OCT 
coefficients, such as the coding data of motion vectors, slice address, type of macroblock, address of the succes- 

30 sive macroblocK type of quantization employed and the like); 

2) while in tiie TM5 model a P controller (proportional) is used, in tiie system of this Invention a P-i controller (Pro- 
portional-integrative) is used; 

3) the P-l controller of the bit-rate employs a so-called technique Bit Usage ProfOe prediction, reducing the number 
of control actions. 



GLOBAL TARGET ALLOCATION 

[0021 ] Let us assume tiiat tiie complexity of pictures of tiie same type be constant in time (stationarity property) and 
to have a number R of bits available for a GOP. The calculation of tiie target bit-rate is carried out before a true coding 
40 resen/ing a sufficient number of bits for the remaining pkAjres belonging to tiie considered GOP. 

[0022] Therefore, if S[n] represents the required number of bits to code the n-tfi picture and if /c pk^ires of the cur- 
rent GOP have already been coded, ttiere will be tfien Rl bits for coding the remaining N-k pictures of the GOP in ques- 
tion. By calling T[n] tiie estimated number of bits (target) for a cunent picture, then; . 



[0023] Let us assume that tiie number of bits per picture necessary for the encoding be equal for all tiie pictures of 
the same type, in accordance to the Bbove mentioned hypothesis of stationarity; this assumption is reasonable if we 
consider the coding of pictures of similar contents from the point of view of their complexity, for example kjy. assuming 
55 that there are not changes of scene. 

[0024] The targets for I, P or B pictures are indicated witii 77. Tp, Tb, respectively The number of I, P and B pictures 
inn the GOP tfiat remain coded is indicated witti M, Np and Nb, respectively. In line with the above assunrptiore: 



35 



45 




(I) 



so 
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RI^Nh Ti+Np • Tp-k-Nb •Tb . (2) 

[0025] The coding of a picture yields two types of data: the bits used to code the DOT coefficients (Huffman coding) 
and the bits used to code the so-called overhead information (that is all that is not a DOT coefficient). In the following 
5 paragraphs we shall conventionally use the symbol to identify the different types of coding for I, P or B frames. The 
overhead data of a genericpicture, denoted by O*, are generally independent of the average contra! signal Qm, which 
is considered the average of mquantfn] over a picture, while the number of bits for coding the OCT coefficients, denoted 
by C*. depends on the quantization parameters. TTierefore: 

10 r*=c*+o* (3) 

[0026] The nunt)er of bits required for the DOT coefficients dqsends not only on the average control signal Qm but 
also on the content of the picture, a measure which may be the complexity X*. Similarly to the T1VI5 technique, we 
assume the validity of the following relation for each type of picture: 



so 



X*^C**Qm (4) 



[0027] The complexity X* may be obtained either ftom previously coded pictures of the same type of the current 
one or through a pre-analysis, which consists in preceding with a fixed and constant Q/n/. In any case tiie conplexities 
20 Xi\ Xp and Xb of the different pictures may be assumed known. 

[0028] It is assumed to strive to obtain a uniform quality in time tyy choosing, relatively for tiie different type of pic- 
tures, average values for the control signal with constant ratios: 

K7p*0/=Q3 (5) 

2S 

Kpb'C^'^Qb 

wtiere Q\ Qp and Qb are the average values Qm relative to tiie I. P and B pictures. The constants Kip and Kpb estab- 
lish the scaling factor of the global complexities used In calculating the global targets. In the original article of Mian, et 
30 al., Kip^LO and /Cpt>=7.0 were selected, while in the TM5 model K7p=7.0and Kpb =1.4 are used. 

[0029] At this point it is possible to derive the equations for the global targets. Firstly, estimates of the 01, Op and 
Ob values are obtained from tiie pre-analysis. It is also possible to estimate the overhead from the data of tiie last frame 
encoded in the same mariner of the current one. 

[0030] The estimated values for the variables C* are derived by solving a system of equations. For darity of nota- 

35 tion the variable RL is introduced: 

RL^RI-[Nh Oi+Np- Op+Nb • Ob)=Ni* Ci+Np- Cp+Nb • Cb (6) 

which denotes the number of bits left for tiie DOT coefficients without any the overhead data. 
40 [0031] At this stage, there are six equations: the relationship (4) which represents three equations, tiie two equa- 
tions (5) £Uid the equation (6), and six unknown variables C/, Cp, Cb, Qi and Qb. The values of M. Np and Nb and the 
complexity values X* are known from the pre-analysis or taken from previous encodings. The flowing global target 
equations for the DOT data can be derived by simple algebraical operations: 

^ a ^^iBk ^ m 

^ NhXi-^Np^Xp/Kip-^Nb^XbUKip'Kpb) ^ ' 



Kip-Kpb'Xi 



55 [0032] From equations (7) it may be observed tiat tiie global targets distribute the bits among tiie pictures accord- 
ing to the relative complexities. Abrupt changes of scene are a problem for the computation model of the global targets. 
In fact up to here the global complexities are partially obtained from pictures that have been already coded in the same 
manner. For the B or P pictures, such a prediction does not create a problem even in the case of an abrupt change of 
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cx3ntent of the pictures of the sequence. However, for the I pictures there may be some problems because the distance 
among them may be at 12 frames. The pre-analysis may give more reliable information on the global complexities, in 
presence of changes of scene. In any case, after a change of scene there is a short period of time in which the human 
sight is less sensible to the bluffing effect of the pictures. This phenomenon allows the use of global targets lower than 
5 what would be really necessary and causes the generation of coding errors in pictures Immediately following a change 
of scene. 

[0033] The Mian algorithm [6] employs a similar procedure of GloiDal Target calculation and therefore leads to the 
same solutions expressed by the equations (7). 

10 LOCAL CONTROL 

[0034] The algorithm for estimating the global targets obtains, as explained above, the average value Qm of the dif- 
ferent mquant[n] and the average number of bits per macroblock. If too many bits are used for a portion of the picture 
the Qj[n] of the macrot^ocks of the remaining portions of the picture must be incremented accordingly in order to obtain - 
15 a k»t-rate of the picture as conpliant as possible with the global target, at the expense of the uniformity of the quality. 
[0035] For each macroblock. the local control computes the relative control signal Qj[n] using Information on the 
maacblocks already encoded and information on the bit usage. The model is thoroughly analyzed in the article [4]"^ 
which contains an analysis of the stability based on the Theory of Automatfo Controls. 

[0036] Rg. 2 shows a block diagram of the local control. A unit of time is taken the interval between the encoding 
20 of two successive macroblod^s. 

[0037] Therefore, the n-th instant indicates the encoding Instant of the n-th macroblock of the cun^ent picture. The 
variables used in this analysis, functions of time n, and indicated in the scheme of Rg. 2 are the following: 

Qj[n] is the value of the reference quantization ^^rameter for the n-th macroblock (or briefly MB); 
25 • s[n] is the number of bits obtained upon effectively coding of the n-th macrot^ock; 

• b[n] is the effective filling state of the output buffer; 

• p[n] is the estimated number of coding bits for the macroblocK deriving from the information produced at the end 
of the pre-analysis; 

t[n] Is the estimated filling state of the t^iffer (that is the target state); 
^30 • r[n] is the number of transmitted bits (eliminated from the output buffer) in the time interval of the n-th maaoblock. 
It should be noted that the term r[n] appears twice In the scheme: it is subtracted from the number of bits s[n] pro- 
duced to calculate the real filling stage of tiie buffer b[n] and Is also subtracted from the estimated number of bits 
b[n] to reach the estimated filling state t[n] of the buffer; 

e[n} is the error between the estimated and effective filling state of the buffer, that is: 

35 

e[nMn]-b[n] (8) 
[0038] The fonctional blocks of the scheme of Rg. 2 are: 

40 • PI; the Proportional-lntegrative local control block; it selects the value of Qj[n] depending on the detected error; 

• MPEG ENCODER: the encoding block; it represents the means that transform the selected value Qj[n] and the 
obtained number of bits s{n] and also the means of adaptive quantization. 

• BUFFER: a data accumulator (memory buffer); its input x and output y are linked by the formula y[n} = yln-IJ-hxln], 
which, in terms of the Zeta Transform, becomes: 

45 

/(2)=-^.X(z) 

so [0039] E(z), B(z), T(z) and P(z) are the Zeta Transforms of e[nl b[n], t[n] and p[n], respectively. The error signal 
E(z) is in the difference-between the buffer content B(z) and the desired content T(z). The latter signal is obtained by 
integrating the bit profile P(z) which represents the expected number of bits per maaoblock. According to the TM5 
model, this number is constant within a picture, that is. it is a hypotized uniform spread of bits anmng all the macrob- 
lock& 

55 [0040] The local controller, whose architecture is shown in Fig. 3. obtains the Qj(z) signal from the em3r signal E(z), 
The controller is of the proportionally-integrative (P-l) type, having a proportional path with a KP gain and an Integrative 
path with a Kf gain. Rom preliminary tests and from tiie indications present In [4]: KP=0.01 and Kl=^,001. 
[0041] The following equation links the quantization parameter Qj(z) to the E(z) error: 
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QKz)=^KP*KI-^^'E{z) 
s (P042] Using the inveree transform and t>y recursivety proceeding to the 0 instant, having placed e[0]=0: 
fi^M = 0H£F +KPe[n]+Kl'£eiu) (10) 



10 



15 



20 



35 



SO 



(9) 



where QJref 's the constant reference quantization parameter for the picture that is used In the pre-analysis and which 
may be calculated in different ways. Such parameter is the average of the Qj[n} of the preceding picture (regardless of 
the type), that is: 



[00431 It should be borne in mind that Qm and QIref^^ the average values of mquant[n] and Qj[n}, respectively, 
for the whole picture, and that they are linked by the average value of the normalized activity Naci[i]. 
[0044] The choice of a proportional-integrative (P-l) controller provides for a regulation without static enror while a 
25 proportional (P) controller (such as in the TM5 model and also adopted by Mian [6]) is affected by the static enror com- 
ponent 

[0045] A compromise is necessary in selecting the controller parameters Kl and KP in order to achieve both the 
stated objectives. If a low gain is set in the control loop, for example a KP dose to zero, there is a loss of control effec- 
tiveness, in such a case the quality of the picture may be very uniform but the bit-rate will be hardly predictable. An 
30 encoder with a relaxed control of the used bits may incur into problems every time there are overflow or underflow situ- 
ations. 

[0046] On the other hand, if a high gain is set, for example a KP dose to 1 , in order to Implement a higher control 
of the bit-rate, the quality of the picture may become rather inconstant. In order 1o solve this problem, the Bit Usage Pro- 
file is used. 



THE BIT USAGE PROFILE 



[0047] The objective is to detennine in advance the number of bits t[n] that will be necessary for each macrobfock. 
this approach is referred to as Bit Usage Profile. To this end, a pre-analysis is used (see Keesman [4]) according to 
40 which a complete encoding of the cun-ent picture is done using a constant Qj[n] value. Thence, the number of bits s[n] 
that each macroblock uses during this preceding process is obtained. 

[0048] Sintilarly to what is done for the global complexiV X*clfa pkrture. a local complexity of a macroblock is 

defined as: 

4S x[nl»s[/7]-Qy[n] (12) 

with the following condition: 



^*=Z4«] (13) 



[0049] The Bit Usage Profile (BUP) of each macrobfock is indicated by t[n]\ it represents the estimated number of 
55 bits that each macroblock will use for effective encoding, with the condition that the sum of the various t[nj be equal to 
the global target T*, that is: 
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(14) 



10 



[0050] By redistributing the t[n} bits among the various macroblocks of the picture, depending on their complexity 
x/n/. the Ibilowing equation is derived: 

m^n^JEi^ (15) 

xln] X* Lx[n\ ^ ' 



15 [0051] Upon assunning that during precoding the Qj[n] is constant (for example equal to 0//|£/r). for the various 
macroblocks it is obtained: - 



20 



UUP tln\ T^s[n\^Qm * s[nV Qj s[n\ , . 

[0052] It should be noted that the TM5 model assumes a uniform bit distribution among the different macroblocte. 
that is 



[0053] A symbolic level, equation (10), which describes the local contrd model, remains unchanged and va«d, 

30 although the value tin] of equation (8) is in reality the BUP desaflaed by equation (16). 

[0054] TTie algorithms described in [4] and [6] produce an oplinral quality and an accurate control of the bit-rate. 
However, their main application is substantially restricted to the professional mari«t sector, where procesang delays of 
a certain entity may be tolerated and wherein costly memories may be used eventually in order to resynchronize the 
video and audio packets. Nevertheless, the pre-analysis inevitaWy lengthens the processing time of the video stream. 

35 On the other hand, the consumer market sector imposes the use of the least amount of memory space in ordw to limit 
costs. 

OBJECT AND SUMMARY OF TtiE INVENTION 

40 [0055] Objective of this invention was to find a Constant Bit-Rate (CBR) control method for hybrid DPCM-DCT 
video coding systems in general (MPEG1, H.263. H.261) and more in particular for coding systems according to the 
MPEG2 standard, which would irnply a reduced memory capacity requisite and therefore reduced hardware costs, with- 
out notably loss of the quality of the transfenred pictures, as compared to known control systems. 
[0056] The method of the invention is able to keep constant the output bit-rate that is produced at the end of the 

46 coding process, with a high accuracy, by using a mix of information deriving from the effective encoding of preceding 
pictures, from the pre-analysis of preceding pictures and from a pre-analysis advantageously executed on a certain 
group of slices or even on a single sfice (GOS) of the current picture; a slice being constituted by at least a whole line 
of macroblocks of a picture. 

[0057] According to preferred embodiments, a GOS pre-analysis may be altematively canied out 

so 

a) by precoding one or more lines of macroblocks (sGce), with a constant reference quantizer; 

b) by way of a statistical calculation (with histograms) of the entropy of the DCT coefficients that approximates the 
average characteristic waveform of the bits necessary to encode the DCT coefficients. 

55 [0058] As compared to a CBR controller of the prior art using a pre-analysis of the whole current picture, the system 
of the invention provides for an outstandingly accurate control while presenring a good quaility of the pictures compara- 
ble to those of the known systems, but with a reduced processing delay and a reduced memory requisite for managing 
pre-analysis data. 
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[0059] By conducting a pre-analysis on a cun-ent GOS rather than on the whole cunent picture, it is necessary to 
store Information limited to the cun^ent GOS, thus reducing the memory requisite, the area occupied on the.silicon and 
the overall cost of the device. 

[0060] Even, the time of pre-analysis of a GOS rather than of the whole picture, is much shorter thus facilitating syn- 
s chronization among the audio and video packets. 

[0061] Surprisingly. It has been found that by operating a pre-analysis on a current slice or group of slices of length 
equal to the width of the picture and in a maximum number of four slices (GOS) at the time, it is possOsle to retain a 
picture quality that Is indistinguishable from that of the known systems. 

[0062] This Is obtained by defining a distribution within each slice of the target bits as a reference for the calculation 
10 of the quantization parameter Qj at the local control level. This technique is referred to as Target Bit Usage Profile 

(TBUP) and constitutes an essential aspect of the novel algorithm of the invention as defined in the appended claims. 

[0063] According to ttie method of the invention, not having pre-analysis data of the whole current picture, but just 

those of a GOS thereof, a mix of the data of the pre-analysis of the GOS of the preceding picture (regardless of its type 

I, P or B) and of data of the effective encoding of tiie preceding picture (regardless of its type I, P or B), is effected in 
15 order to implement said TBUP technique acoording to tiie two alternative pre-analysis mettiods, mentioned in the above 

paragraphs a) and b). The requested memory capacity to store the above mentioned data is in any case negligible, 

being such data related to the number of slices or GOS contained in one whole picture. 

[0064] Finally, in order to Implement a tight bit-rate control while presennng a substantially unifbmi quality of the pic- 
tures, the Bit Usage Profile (BUP) of each macroblock representing the estimated number of bits that each macroUock 
20 will be using for the effective encoding is computed. 

[0065] In case of a pre-analysis according to the paragraph a) mode, involving the precoding of a slice or multislice 
GOS of tiie cun'ent picture, ttie BUP is obtained by determining in advance tiie number of bits necessary for each mac- 
roblock through the pre-analysis Hself. 

[0066] In case of a pre-analysis according to tiie paragraph b) mode, involving the calculation of the entropy on a 
25 slice or multislice GOS of tiie current picture, the BUP cannot be obtained from ttie pre-analysis itself ttiat is only capa- 
ble to produce an estimate limited to tiie GOS being processed and not for each macrot)lock; however, even for this sec- 
ond embodiment, ttie BUP may be obtained by determining the number of bits necessary for each macroblock from ttie 
information stored during ttie effective coding of tiie GOS of ttie preceding picture. 

30 BRIEF DEgCRIPTIQN OF THE DRAWINQ3 

[0067] 

Rgure 1 is a functional block diagram of a GBR control algorittim witti pre-analysis of ttie whole pk;ture, according 
35 to the known technique, as discussed above. 

Rgure 2 is an architectural block diagram of the CBR control function of an MPEG encoder. 
Figure 3 is an architectural scheme of a proportional-integrative local contrdler. 

Rgure 4 is a functional block cOagram of a CBR contix)! algorithm witii pre-anatysis of a GOS, of the present inven- 
tion. 

40 Rgure 5 is a flow chart of an embodiment of tiie invention (K.sl.1 ), based on a pre-analysis with precoding of one 
slice of the current picture to implement ttie BUR and based on tiie use of data resulting from tiie effective encoding 
of the preceding picture, of ttie same type of ttie current one for implementing the TBUP 
Rgure 6 is a flow chart of an embodiment of tiie invention (K.sl.3). based on a pre-analysis with precoding of one 
slice of the cunent picture to implement the BUP, and based on ttie use of data resulting from tiie pre-analysis witti 

45 preceding of ttie preceding picture, of the same type of the current one for implementing ttie TBUR 

Rgure 7 is a flow chart of an embodiment of the Inverition (qL^t), based on a pre-anatysis witti entropy compu- 
tation on a GOS of tiie current picture for calculating ttie bits target of a slice within the above mentioned GOS 
{Tslice), and based on using the data resulting from ttie effective encoding of the preceding picture of tiie same 
type for implementing ttie TBUP, and finally based on a BUP of a simply uniform distrOsution. 

50 Rgure 8 is a flow chart of an embodiment oft he invention (prof j[)rec), based on a pre-analysis with entropy cal- 
culation on a GOS of tiie cun'ent picture for calculating ttie bit target of a slice within tiie above mentioned GOS 
(Fs/Zce). and based on using data resulting from the effective encoding of preceding picture of ttie same type for 
implementing both the TBUP and tiie BUR 

Rgure 9 is a flow chart of an embodiment of tiie invention (prof^med). based on using data resulting from the 
55 effective coding of the preceding picture of ttie same type for implementing both tiie TBUP and ttie BUP. and based 
on the average of data from the effective encoding of ttie preceding picture and from the pre-analysis and entropy 
calculation on a GOS of the cunent pkrture, for calculating ttie bit targets of a slice within the GOS (7s//ce). 
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DESCRIPTION OF SEVERAL EMBODIMENTS OF THE INVENTION 

[0068] By referring to Rg. 4, the ensuing description of the system of the invention follows a scheme to the one 
used for describing the known system of Fig. 2, The differences between the system of the invention and the known 
5 system will be punctually highlighted. 

GLOBAL TARGET BIT AII.0CAT10N 

[0069] From the method of the invention the relationships expressed by way of the above equations (3). (4), (5), (6), 
10 and (7), still hold. 

[0070] According to the invention, by canning out a pre-analysis on a GOS, at the beginning of the actual encoding. 
Infonnation of the overhead data for the whole picture Is not available. 

[0071] What is done is a comparison between the number of overhead bits estimated by the pre-analysis on the 
first GOS multiplied by the number of GOS (which means to consider constant the overhead on each GOS) and the . 
75 overhead calculated in the pre-analysis of preceding pictures of the same type. The lower of these two values is used 
for calculating the global target, thus underestimating the number of not DOT bits estimated for the remaining pictures 
of the GOP and producing a less stringent global target for the current frame. 

[0072] For the first P and B pictures of each GOP, excluding the first, the overhead is calculated as the average of 
the overheads of the preceding GOP so to have a more realistic estimate given the lack of Information at the beginning 
20 of a new GOR'Since the I picture is not subject to motion compensation, it is characterized by a smaller nuntf^er of over- 
heads bits than the otiier types of pictures (P or B) and rt is then treated separately by considering the overhead of tie 
corresponding 1 picture of tiie preceding GOP. 

[0073] The same applies also for the complexity X* (see equation 4), which may no longer be obtained from tiie 
pre-analysi s, because at the end of the process! ng of the first GOS there is not enough information on tfie whole picture. 
25 However, the complexity X* is obtained by considering tiie DOT bits of tiie pictures of the same type, previously 
encoded. 

BIT USAGE PROFILE 

30 [0074] As already explained above, the BUP permits a tight control of the bit-rate while preserving uniformity of the 
quality of the pictures. According to the method of the invention, the BUP is obtained by determining in advance tiie 
number of bits that wfll be needed for each macroblock t[n] ttirough a pre-analysis on tiie slice that is about to be 
encoded. 

[0075] The pre-analysis consists in pre-coding all the macroblocks of the current GOS keeping the quantization 
35 parameter Qj constant, similariy to what is done in Keesman [4]. In this way, an estimate of the number of bits used by 
each macroblock s[n] is produced. 

[0076] By considering the restraint that fhe sum of the various t[n] be equal to the global target of each slice, Tslice, 
the following formula is obtained for calculating the BUP of each macroblock: 



where Zs[n] is equal to the sum of tiie slice bits obtained from the pre-analysis. 

45 [0077] in relation to tiie equation (1 6), It is employed a global target for tiie current slice ( Tslice) rather ttian for the 
whole picture (7>). A first way to detemiine it is to consider tiie estimated bits uniformly distributed on tiie different 
slices, so taking as a reference the uniform distribution of bits among tiie various macroblocks. This type of solution may 
deviate from reality when, for instance, a part of the picture requires more bits tiian another one. 
[0078] According to the invention, tiiis drawback is effectively overcome by Introducing a Target Bit Usage Profile, 

so as better explained In tiie ensuing desaiption. 

[0079] Rnally, it should be noted that for tiie case of a pre-analysis with calculation of the entropy, as per the 
approach used by Mian [6], the s[n] may be obtainKl only by knowing the bits effectively used in encoding the preceding 
pictures (regardless of the type I, P or B), because the pre-analysis with calculation of histograms generates only global 
estimates for the GOS ratiier than for the individual n-th macroblock. 
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TARGET BIT USAGE PROFILE IN AN APPLICATION WITH THE PRE-ANALYSIS FOLLOWING A KEESMAN 
APPROACH 

[0080] By considering uniform the distribution of the target bit on the slices, we experimentally obtain a decrement 
5 of the PSNR as compared to an algorithm with pre-analysis of the whole picture and a worsening of the enor on a single 
macroblock. 

[0081 ] These consequences are effectively remedied by defining a Target Bit Usage Prof fle (TBUT), which repre- 
sents an estimate of the target bit profile among the different slices and which is evaluated in the pre-analysis phase or 
derived from the encoding of preceding pictures. 
10 [0082] At the end of the encoding process or of the pre-analysis of the preceding picture, the numb& of total bits 
used by the encoder for each GOS Sqos and the number of total bits for each slice of the GOS being processed. 
Ss//ce. are stored. By considering for example the j-th GOS (containing at the most 4 slic^): 



15 



Scos^^Sslice[k] (18) 



NtrniMn 



20 Spree = Y^Sco^k] 



where NumMax is the maximum number of GOS in a picture and Spree represents the total nunftber of bits for the whole 
2S pictur& 

[O083] The target for the n-th GOS, representing the Target Bit Usage Profile (TBUP). is given by: 

TBUP.T^nlJ-^>r (19) 

30 

with T* equal to the target of the whole picture, found at the beginning of the encooOng process by way of the Target Bit 
Allocation technique. Of course, the sum of all the Tqqq in a picture is equal to T*. 

[0084] If the GOS contains only one slice: Ts//ce= Tqq^ and Sslice=SQQs . In this case the BUP is explicitly given 
35 by equation (1 7) rewritten below as equation (20): 

aaP=f[nl=7s/ice-^ (20) 

40 

where as usual Ss[n] is equal to the sum of the slice bits obtained by the pre-analysis with precoding of the cunrent slice 
of the ojrrent picture. 

[0085] Instead. If the GOS contains more slices (4 at the most), the BUP is more accurate because it tends to 
become more similar to the BUP of the whole picture (as would be the case if the GOS contains all the slices of a pic- 
46 ture), thus equation (1 7) becomes: 

BUP.tln]^T^s^^^ (21) 

so 

where in this case Ss[n] is equal to the sum of the GOS bits derived from the pre-analysis with precoding of the current 

GOS of the cun'ent picture. 

[O086] It should be noted that the values of Sqos ^^lice may be obtained in different ways depending on 
whether a knowledge of the bits calculated during the pre-analysis of the preceding pictures or during their effective 
55 encoding is used, in both cases regardless of their type: 1. P or B. 
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TARGET BIT USAGE PROFILE IN AN APPLICATION WITH PRE-ANALYSIS ACCORDING TO MIAN 

[0087] At the end of the coding or of the pre-analyas of the preceding picture, the number of total bits Sqos pro- 
duced by the encoder for each GOS. and the nuniber of total bits Sslices per each slice of the GOS being processed 
are stored witti the conditions that, for the j-th GOS (containing at nnost four slices): 



Scos=-^Sslice[k] (22) 

10 A.| 



f/umMax 



15 



20 



30 



40 



Spree = ^ Sgo^'] 



where NumMax Is the maximum number of GOS in a picture, while Spree represents the total number of bits used for 
the preceding whole picture, either encoded or pre-analyzed. 

[0088] The target for the n-th GOS. representing the Target Bit Usage Proffle (TBUP). is given by: 



Spree 



TBUP^ TQos[n]= • ^* (23) 



2ff where 7> is equal to the target for the whole picture, found at the beginning of the encoding process by way of the Glo- 
bal Target Bit Allocation; of course, the sum of all the Tqqs in a picture must be equal to 7*. 
[0039] In calculating Tslice, only data derived from pre-analysis with entropy calculation of the current GOS of the 
cun^ent picture, tfiat is: 



Tslk:eW=T^UV^^^ (24) 



where Eqqs represents the number of total bits of the j-th GOS being processed, estinrated by way of the entropy, while 
35 EslicelUlk is the number of estimated bits of the single k-th slice, with the condition that, for the j-tti GOS. it is: 



Ems[j] = £ EsliceUm (25) 



[0090] The above equation (24) is not necessary in a pre-analysis conducted according to Keesman, because if tiie 
GOS contains only one slice, Tslioe is calculated with equation (20) othemvise Tqqs is calculated with equation (21). 
45 [0091 ] Rnally equation (1 7) which calculates the BUP may be transformed in the IbllGwing expression: 

BUP^m^Tslice^^^ (26) 

50 

where, in this case. s[nlsin6 Es[n] are obtained from the effective coding of the corresponding GOS of the preceding 
picture, liecause information on tiie current picture is not available. 

[0092] Fundamentally, equations (20) (Keesman) and (26) (Mian) are alike, the only difference being that s[n} and 
S$[n] are derived from pre-analysis with precoding (equation (20)) of the cun^ent GOS (containing only one slice) of the 
55 current picture, otherwise from the effective encodng data of tiie corresponding GOS (including more than one slice) 
of tiie preceding picture (equation 26). ' 
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REQUIRED MEMORY SIZE FOR PRE-ANALYSIS DATA 

[0093] As already mentioned, the fact of carrying out a pre-analysis only on the GOS involves a large reduction of 
the memory that is necessary for storing ttie data. Indeed, by assuming to use a 32 bit precision arithmetic architecture 

5 (of the "float" type according to the computer language C), the memory occupation in case of pre-analysis on a four 
slice GOS typically may be 4 x 32 tut 45 maaoblocks per slice, that is, a total of 5.760 bits for any significant information 
(OCT bits, overhead bits, total bits). By contrast, in case of pre-analysis conducted on a whole picture, according to a 
prior art system, there the memory occupation would be 32 bit x 1 ,620 macroblocks. that is 51 .840 bits for significant 
information. This implies a saving of 46.080 bits, or of afcwut 88%. ft the GOS contains only one slice, the memory sav- 

10 Ing rises to 97%. 

[0094] The memory capacity necessary to store the Sslice information, deriving from a pre-analysis of effective 
coding (of the preceding picture), is negligible, being it equal to the only number of slices in a picture (there are 36 slices 
in a PAL picture of 576 active lines), that is 32 bit x 36 slices « 1 ,152 bits. 

15 LOCAL CONTROL 

[0095] The controller used for a local control is of the integrativeiaroportional type. Once the combined processi ngs 
according to the known Bit Usage Profile technique (alternatively according to one of the equations (20). (21) and (26)) 
and according to the technique of the present invention that introduces the Target Bit Usage Profile (alternatively 
20 according to one of equations (19). (23) and eventually also (24) for a Mian connpliant version), have been completed, 
in order to f ind the target number of bits for each macrd>lock t[nl it is possible to determine the Qj control parameters 
at the local level (equation (10)). whidi may then be used, through an adaptive quantizati<»i. for calculating the real 
quantization parameter mquant. 

[0096] Symbolically, equation (10), which describes the local control model, remains valid and unchanged, with the 
25 only difference that only the t[n] of equation (8) is in reality the BUR that is alternatively determined by the equations 
(20), (21) or (26). 

[0097] Of course, for calculating the local en-or of each first macroblock of a slice, it is necessary to store in tiie 
memory tiie datum con-esponding to tiie target number of bits found for tiie last macroblock of the preceding slice. In 
tiiis way, it is avoided tiiat tiie slices be not considered as separate items so that tiie transition from one to the otiier may 
30 result visible in tiie encoded picture. 

EXPERIMENTAL MEASUREMENTS 

[0098] This section reports the experimental results of different implementations of the method of the invention of 
35 GOS pre-analysis. As samples of video sequences have been used the so-called Calendar, Stefan and Flowers 
sequences, coded at 4 Mbps (Megabit per second) and at 10 Mbps. 

[0099] It is reminded tiiat not having available pre-analysis data of a whole current picture but only tiiose of a GOS 
(which may include four slices at tiie most), a mix is done of ttie pre-analysis data of tiie preceding picture (regardless 
of its type I. P or B) and/or of tiie data deriving from ttie effective coding of the preceding picture (regardless of its type 
40 I, P or B), in applying ttie equations (19) and (23) (TBUP). according to an embodiment of the method of tiie invention 
compliant with the known method of Keesman and/or of Mian. 

[0100] However, to apply equation (26) (BUP), in a Mian compliant embodiment only data originating from coding 
tiie preceding picture may be used. By contoast ttie equations (20) and (21), according to an embodiment compliant 
witti the mettiod of Keesman. require data derived from ttie pre-analysis of tiie current GOS of ttie current picture. 
45 [0101] Finally, according to an embodiment compliant witti ttie mettiod of Mian, to implement equation (24) for 
Tsiice, only data originating from ttie pre-analysis of tiie coding of the cun-ent picture may be used. 

PRE-ANALYSIS WITH PRECODING OF ONLY ONE SLICE 

50 [0102] With regard to the pre-anatysis witti a preooding, a pre-analysis on a GOS witti only one slice has been sim- 
ulated. The embodiments of ttie algorithm tested were ttie fbltowing: 

1) K.sl.1 : Kpb^tO ttie TPUB (Sqqs and Spree of equation (19)) is derived from the coding bits of the immediately 
precedent picture of ttie same type (I. P. B). as shown in Fig. 5. 
55 2) K,s\2: Kpb=1.0 and the TPUB (Sqqs and Spree of equation (19)) derived from ttie bits of ttie pre-analysis ttie 
immediately preceding picture, regardless of its type. 

3) K.sl.3: Kpb=tO and the TPUB (Sqqs ^prec of equation (19)) derived from the pre-analysis bits of the 
immediately preceding picture of ttie same type (I. P or B). as shown in Rg. 6. 



13 



EP1005 233A1 



4) K.sl.4: Kpb=1.0 and the TPUB (Sqqs and Spree of equation (19)) derived from the pre-anaJysis bits of the 
immediately preceding picture, regardless of its type if P or B. if of type I the pictures were separately considered 
by refening to the picture I of the preceding GOR 

5) K.sl.5: same as K.sl.3, the difference being that during the Target Bit Allocation phase to the I pictures is 
5 assigned the estimated number of bits incremented by 20% while the one to the P pictures is incremented by 5%, 

to produce a somewhat less accurate bit-rate control but ari enhanced uniformity of the quality of the pictures. 

6) K.sl.6: same as ICsl.4. however /Cp& = 1.4 instead Q(tKpb= 1.0. 

7) K.sl.7: same as K.sl.3. however Kpb = 1.4 instead of Kpb = 1.0. 

10 [0103] For all these test embodiments, the pre-analysis on the current GOS of the current picture is only used for 
the applying equation (20). that is to estimate s[n] and Zsin]. 

[0104] Table 1 compares the performance of the slice algorithms of the invention with that of the method of Kees- 
man [4] and of the TM5 model [3]. for the Calendar sequence coded at 4 Mbp& The figures of interest are: 

15 • the average percentage error for the GOP (Err GOP), calculated as the difference between the Target Bits of a sin- 
gle GOP and the effective number of bits used In encoding, averaged among all the GOPs of the sequence; 

• average percentage error among the target bits and those effectively used, for each picture; averaged for the whole 
sequence (Err. SEQ); 

• the picture PSNR averaged on the whde sequence; 

20 • the PSNR of each single macroblock. averaged on all the pictures of the sequence. This measure takes into a 
greater consideration the artifacts at the macroblock level, which would be inevitably hidden by the intrinsic averag- 
ing of the PSNR parameter, that traditionally is calculated on the whole picture; 

• the mquant average for ail the macroUocks of the sequence; 

• the mean deviation of the mquant (for the entire sequence) in respect to the above average value. 

[0105] It may be noted that the TM5 model gives the highest PSNR, though to the detriment of a less accurate con- 
trol of the bit-rate (average error of 1 .61% on the GOP and of 0. 1 9% on the sequence). The Keesman algorithm [4], with 
the pre-analysis of the whole cunent picture, provides a very accurate control (average error of 0.0028% on the GOP 
and of 0.00006% on the sequence), better by least 3 to 4 orders of magnitude than the TM5. By contrast, the different 
30 inplementations of the method of the invention of slice pre-analysis, loose at most one order of magnitude in tenns of 
the precision over a sequence but ensure a definitively more accurate control than the TM5 model. Differences of tiie 
average PSNR among macroblocks are even smaller. Table 1 reports these results. 



45 



50 



Algorithm 


Error GOP (%) 


Error SEQ (%) 


PSNRaverage 
on frames 


PSNRaverage 
on MBs 


Mquant aver- 
age on 
sequence 


Mean devia- 
tion on 
sequence 


TM5 


1.61 


0.19 


27.88 


30.78 


32.0 


5.34 


Keesman 


0.0028 


0.00006 


27.03 


29.99 


32.7 


2.0 


K.sl.1 


0.01 


0.0009 


26.3 


29.7 


37.1 


3.55 


K.81.2 


0.004 


0.00008 


26.97 


30.0 


34.1 


2.7 


ICsL3 


0.003 


0.0003 


26.95 


29.96 


33.6 


2.4 


K.sl.4 


0.003 


0.0002 


26.92 


29.94 


33.9 


2.5 


K.sl.5 


0.007 


0.06 


27.46 


30.47 


32.8 


3.8 


K.sl.6 


0.01 


0.00018 


27.64 


30.73 


34.8 


2.5 


K.sl.7 


0.02- 


0.0008 


27.8 


30.8 


34.0 


6.7 



[0106] With regard to the measures of the quality of the pictures in terms of PSNR. the Keesman algorithm looses 
55 about 0.8 dB as compared to the TM5 model. However, such a degradation is considered negligible according to sub- 
jective tests (the PSNR is an objective measure tiiat becomes well correlated to visual evaluations starting from differ- 
ences above 0.8-1.0 dB). The PSNR differences between the various test embodiments of tfie method of tfie invention, 
based on a slice pre-analysis, and the method of Keesman are much smaller ttian said range and often tiie conrparison 



14 



EP1005 233A1 



is In favor of the slice pre-analysis method of the Invention. 

[0107] The K.sl.3 algorithm may be considered the best, and for such a preferred Innplementation the results 
obtained for the Stefan and Flowers sequences (coded at 4 Mbps) are also reported in the following table. 



Table 2 



Sec|uence/atg 
orithm 


En^orGOP 


En^orSEQ(%) 


PSNR average 
on frames 


PSNR average 
on MBs 


Mquant aver- 
age on 
sequence 


Mean devia- 
tion on 
sequence 


StefaiVTMS 


2.02 


0.12 


30.0 


33.4 


. 29.9 


7,6 


Stefon/Kees- 
man 


0.006 


0.0006 


29.6 


33.1 


29.7 


5.9 


Stelan/K.sl.3 


0.01 


0.00008 


29.3 


32.9 


33.0 


7.7 


Flowers/TM5 


1.03 


0.007 


28.33 


32.72 


32.1 


6.25 


Rowers/Kees- 
man 


0.0014 


0.0003 


27.59 


31.96 


32.49 


2.49 


Rowers/K.sl.3 


0.002 


0.0005 


27.39 


31.84 


33.2 


2.7 



[01 08] As far as quantization is concerned, the above tables report also the average value of the mquant of all the 
macroblocks of the sequences, as an index of a spatial quality; a high value meaning a strong quantization and a con- 
sequent blurring of the picture, while a low value of maquant mearting a finer quantization and thereby a minimum loss 
of picture content. 

[0109] From this point of view, it is evident that the TM5 model, having a less accurate bit-rate oontrol, has average 
values consistently lower than the other algorithms with preceding. Nevertheless, the mean deviation of TM5 is higher 
(with the only exception for the Stefan sequence) than In the algorithms witii precoding. This means that the local quality 
of the algorithm with precoding Is more uniform over the whole picture. 

[0110] Such an effect may compensate the slight Increment of the average mquant Indeed, It may be better to 

watch pictures of uniform quality, even if slightly more quantized, rather than pictures with zones of significant quantiza- 
tion difference (of nonuniform quality). The optimal performances of the K.sl.3 version of the algorithm of the invention 
compared to tiiose obtained with precoding of the whole picture, should be noted. Relatively to mean deviation, even 
witii a limited information derived form the pre-analysis of a single slice, a mean deviation very dose to the one tfiat m^ 
be obtained with the pre-analysis of tiie whole picture is obtained. 

[0111] Finally, the IbDowing table reports these measures on the sequences coded at 10 Mpbs. 



Table 3 



Sequence/alg 
orilhm 


EnrorOOP 


EnorSEQ(%) 


PSNR average 
on frames 


PSNRaverage 
on MBs 


Mquant aver- 
age on 
sequence 


Mean devia- 
tion on 
sequerice 


Calendar/TM5 


0.54 


0.1 


33.2 


35.6 


13.4 


2.22 


Calen- 

dar/Keesman 


0.0003 


0.00009 


32.36 


34.88 


13.9 


1.2 


Calen- 
dar/K.ls.3 


0.0006 


0.00007 


32.36 


. 34.95 


14.0 


1.3 


StefannM5 


0.3 


0.05 


35.73 


37.84 


11.69 


2.53 


Stefan/Kees- 
man 


0.0003 


. 0.0001 


35.74 


37.8 


11.5 


2.0 


Stefem {K.sl.3) 


0.001 


0.00008 


35.42 


37.62 


. 12.6 


2.9 


Rowers (TM5) 


0.2 


0.03 


33.7 


37.27 


13.43 


2.43 
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Table 3 (continued) 



Sequence/alg 
orithm 


Error GOP 
(%) 


Error SEQ(%) 


PSNR average 
on frames 


PSNRaverage 
on mds 


Mquant aver- 
age on 
sequence 


Meandevia- 

uon on 
sequence 


Flowers (K 
orig.) 


0.002 


0.00006 


33.45 


36.79 


13.4 


1.5 


Flowers 
(K.sl.3) 


0.002 


0.0001 


33.36 


36.8 


13.6 


1.6 



[0112] By increasing the bit-rate, the average values of PSNR and maquant become very similar for all the three 
methods compared; namely: T1\/I5, preceding of a complete picture and preceding of a single slice (invention). The 
accuracy of the control is def initely better in preceding systems. 

15 

THE PROBLEM OF SCENE CHANGES 

(Oil 3] In Older to test the behavior of the different methods during changes of scene, some tests simulations were 
can'ied out on a sequence (Puzzle) of 50 pictures coded at 4 Mbps, artificially generated by copying frames belonging 
20 to different sequencer In particular, the pictures 1 to 7 were taken from the Calendar sequence. 8 to 21 from a Tennis 
Table sequence, 22 to 23 from Edit. 34 to 38 from Voitur and 39 to 50 from Poppl& 



Table 4 



Algorithm 


Error GOP (%) 


Error SEQ (%) 


PSNRaverage 
enframes 


PSNRaverage 
onMBs 


Mquant aver- 
age on 
sequence 


Mean devia- 
tion on 
sequence 


TM5 


3.85 


0.35 


29.6 


31.6 


30.16 


8.89 


Keesman 


0.002 


0.00005 


29,14 


31.2 


30.41 


8.42 


K.sl.3 


0.01 


0.0008 


28.65 


31.0 


33.79 


8.9 



[0114] From the results of Table 4. it may be noted that said how the bit-rate control is more accurate for the two 
35 pre-analysis algorithms as compared to the TM5 one. The slight increase of the en^r arei the PSNR with the pre-anal- 
ysis algorithm of the Invention involving only a single slice (K.sl.3), in respect to the one with predecoding of the whole 
picture (Keesman). is due to the fact that despite the exptoilation of information derived from pre-analysis of the current 
slice of the current picture, some parameters, such as the complexity and the overhead, are inevitably calculated by 
refem'ng to past pictures. Instead, the complete picture pre-analysis algorithm of the prior art calculates all the needed 
40 quantities for the current picture before its actual encoding, thus better determining an Intervening change of scene. 
[Oil 5] To improve even further the control, in a method according to the present invention, it is possible to use addi- 
tional infonnation deriving it from the motion estimation block, so as to improve the ability to recognizing changes d 
scene. Accordingly, the picture corresponding to a change of scene may be coded as an l-pidure. obviously if the bit- 
rate control system has a sufficient number of bits available. The capacity to can^ out a replay with fast-fonward or with 
45 fast-rewind, based on a fixed dimension of GOP (e.g. 12 pictures), is not lost anyway because tiiere will always be at 
least one l-ptcture every 12 pictures (in a GOP of a fixed size). 

PRE-ANALYSIS BASED ON THE ENTROPY CALCULATION ON MORE THAN ONE SLICE 

50 [0116] in tiie case of an alternative embodiment of tiie method of tiie invention, wherein the pre-analysis is carried 
out by way of a calculation of the entropy, a pre-analysis on a GOS witii 1 . 2. 3 and/or 4 slices was hypothized. The 
results relate to tiie following variants tested with a 4-slice GOS. 

1) QL^tat the reference Qj of the whole picture is assumed computed from tiie pre-analysis with entropy calcula- 
55 tion on tiie first GOS of the cun^ent frame. The sum of the estimated bits per slice {Eqos anci Estice of equation 
(24)) is connputed by way of a pre-analysis of the GOS of the current picture. Instead of using tiie BUP of equation 
(26), a uniform bit distribution witiiin tiie single slice is assumed. The TBUP of equation (23) exploits tiie information 
SQos^nd Spree relative to tiie effective encoding of tiie preceding picture, as illustrated in Fig. 7. 
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2) Prof_prec: the reference Qj is calculated as in Qj^stat The bit sum per slice (Eqqs and Eslice of equation (24)). 
is calculated by way of a pre-analysis of the GOS of the current picture. The BUP {s[n] and ls[n] of equation (26)) 
is derived from the effective bits of the previously coded picture. The TBUP of equation (23) exploits the information 
Sqos ^P^^ relative to the effective encoding of the preceding picture, as illustrated in Fig. 8. 
5 3) Prof_medio: the reference Qj is calculated as in Qjjstat. The bit sum per slice is obtained by averaging two val- 
ues, the first value is obtained by exploiting the pre^alysis of the cun-ent GOS of the current picture, the second 
value is obtained from the effective bits of the previously coded picture. The BUP is calculated as in Prof j)rec. 
Rnally a Kpb = 1.4 Is used instead of Kpb = 1 as used in the two preceding test embodiments. The algorithm is 
illustrated in Fig. 9. 

10 

[01 17] By coding the sequences at 4 Mbps, the results shown in the following table are obtained. It appears evident 
a close comparison between the TM5 model and the control methods that use a pre-analysis with calculation of the 
entropy either earned out on the whole current picture, as in. the known Keesman approach, or on single GOS according 
to the present invention. 

15 



l^eS 



Sequence/ialgorHhm 


EnrorGOP(%) 


Error SEQ (%) 


PSNR average on 
frames 


PSNR average on MBs 


Ca!endar/TlVl5 


1.6126 


0.1337 


27.88 


30.78 


Calendar/QLstat 


0.0276 


0.00803 


26.95 


29.61 


Calendar/Prof j)rec 


0.0736 


0.00700 


26.21 


28.94 


Calendar/Prof.medio 


0.0509 


0.00715 


27.08 


29.94 


FlowersAl^S 


1.6568 


0.20034 


28.36 


32.72 


Ftowers/QLstat 


0.0706 


0.00258 


27,63 


31.57 


Flowers/Profj)rec 


0.2142 


0.00404 


26.70 


31.04 


Flowers/Prof.medio 


0.2712 


0.00567 


27.68 


31.91 



[01 18] It should be pointed out that the performances of the algorithm of the invention with pre-analysis carried out 
35 by way of the calculation of the entropy, are approximately midway between those of the TM5 model and those of with 
preceding. This is so because the entropy is just an approximation measure of the effective bit-rate. The pre-analysis 
with a precoding is more reliable in calculating the bit-rate. In any case, the control through a calculation of the entropy 
is simpler to implement, because it is sufficient to produce the histograms of the DCT coefficients rather than performing 
a precoding. 

40 [01 1 9] By coding the sequences at 1 0 Mbps, the following results are obtained. 



Table 6 



Sequence/algorithm 


Error GOP (%) 


Error SEQ (%) 


PSNR average on 
frames 


PSNR average on MBs 


Calendar/TM5 


0.67 


0.12333 


32.51 


34.99 


Calendar/QLstat 


0.0288 


0.00726 


31.86 


34.16 


Calendar/Prof jDrec 


0.0102 


0.00186 


30.49 


33.47 


Calendar/Prof.medio 


0.0045 


0.00313 


31.76 


34.38 


Flower8niVI5 


0.9623 


0.17519 


33.05 


36.72 


FIcwers/QLstat 


0.0142 


0.00183 


32.9 


35.94 


Flowers/Prof jprec 


0.0058 


0.00041 


32.05 


35.53 


Flowers/Prof^medio 


0.0071 


0.00023 


32.58 


36.09 
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[0120] In this case, it may be noticed that by increasing the bit-rate, the average PSNR values are very similar for 
all the compared methods, however the accuracy of the control is definitively better for pre-analysis systems. 
[0121 ] In particuian by considering th^ results at 4 Mbps and at 1 0 Mbps, the Prof.medio anbodiment of the algo- 
rithm of the invention shows the best performance overall. 

s 

Claims 

1. A method of controlling constancy of the bit-rate at a certain value (CBR). depending on optimization criteria in 
transferring a data stream of encoded picture sequences, comprising the steps of 

10 

a) estimating (Global Target Bit Allocation) the number of bits necessary for encocfing a cunrent picture and the 
successive pictures belonging to the same group of pictures or GOP of a sequence^ based on a target number 
of (T*) coding bits for each picture of a sequence, in function of encoding data (0«», X*, Q*) of pictures pre- 
ceding the current one and/or on the results of a pre-analysis; 
IS b) detenmining (Local Control) the ref^'ence value (QJ[n]) of the quantization parameter at a local level for the 

n-th macroblock of data, that complies with the limit established by said estimated number of bits for the n-th 
macroblock (/fn/): 

c) calculating (Adaptive Quantization) of an effective quantization parameter {maquant[n]) for each macroblock 
in function of the filling state of an output buffer of said data stream of coded data and of an activity parameter 

20 of the macroblock (Nact) : 

d) calculating a distribution profile of target bits over a picture (Bit Usage Profile or BUR), by way of a pre-anal- 
ysis of a certain number of lines of ntacroblocks or slices (GOS) of the current picture, distributing the allocated 
bits in function of the local complexity {x[n]) of the single n-th macroblock and of the number of bits {s[nj) effec- 
tively used in said pre-analysis; 

25 e) calculating a distribution profile of target bits (Target Bit Usage Profile or TBUP) over said plurality of lines of 

macroblock or slices (GOS) given by the ratio between among the estimated number of bits required for the 
encoding (Sqos* Spree) obtained by way of a pre-analysis of the current GOS of the cunrent picture or from 
the effective coding of the corresponding GOS of the preceding picture, multiplied by said target number of bit 
(7*) for of ttie whole picture; 

30 said pre-analysis of the current GOS being carried out by preceding the GOS using a constant reference quan- 

tization parameter. 

2. The method according to claim 1 . characterized in that in step d) (Bit Usage Profile or BUP), the effective encoding 
of said certain number of whole lines of macroblocks or slices (GOS) of the preceding picture is used for distributing 

35 the allocated bits in function of the local complexity of tiie n-th single macroblock {x[n]) and of the number of bits 
effectively used during said effective ooding (s[n]), and in that after step e) the method Includes the following step 

f) intermediately computing the distribution profile of tiie target bit among tiie different lines of macroblocks or 
slices as tiie ratio between enti-opy values (Eqqs* ^slice). derived from a pre-analysis of the current GOS of 
40 tiie current picture, muiti'plied by said target number of bit usage profile (TBUP); 

said pre-analysis of the current GOS being earned out by way of the calculation of tiie entropy on ttie histo- 
grams of the OCT ooefficiafits off the whole GOS. 

3. A video encoder conprising means for controlling tiie constancy of the bit-rate at a certain value (CBR), depending 
45 on optimization criteria in transferring a data stream of encoded picture sequences, said means comprising an out- 
put buffer, an integrative-proportional controller input witti the state of fDling of said output buffer and outputting a 
bit-stream which is transmitted ttirough a transmission channel and a control signal for varying the value of tiie ref- 
erence quantization parameter for an n-tti macroblock of data by said encoder, characterized in tiiat said controller 
uses estimated data of a certain number of bits {t[n]) necessary for encoding a current picture based on a target 

so number of bits ( T*) for each picture of a sequence determined function of pre-analysis data of a certain number of 
lines of macroblocks. or slices (GOS) of tiie curent picture (s/n/, Es//ce, Eqos) and of the data of an effective cod- 
ing or of pre-analysis data of a certain number of lines of maaoblocks or slices (GOS) of the preceding picture 
(Spree, Sqos)* such a pre-analysis being undertaken by precoding said number of slices (GOS) using a constant 
reference quantization parameter or by a statistical computation of the entropy of the discrete cosine transform 

55 (OCT) coefficients of the macroblocks of said lines of macrobbcks or slices. 

4. The decoder according to claim 3. characterized in that it comprises a memory for storing said pre-analysis data of 
said number of lines of macroblocks or slices (GOS) of tiie cun-ent picture {s[n], Eslice, Eqos) ^ 
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effective coding or of a pre-analysis of a certain number of lines of macroUocks or slices of the preceding picture 
(Spree, Sqos). said pre-analysis being carried out either tfirough a preceding of said number of slices (GOS) using 
a constant reference quantization parameter or by estimating the entropy of the discrete cosine transform (DCT) 
coefficients of the macroblocks of said lines or slices (GOS).. 
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